13:56
2026-06-04
forum.effectivealtruism.org
artificial-intelligence
Tamper-Resistance is a Moving Target We Might Not Hit
Open-weight AI models are fundamentally harder to safeguard than closed models because adversaries can extract weights directly and retrain them to bypass embedded protections. In September 2025, the โฆ